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Production of antibodies using gene libraries* 



Description 

Background cf th e Invention 

5 Monoclonal and polyclonal antibodies are useful for 

a variety of purposes. The precise antigen specificity 
of antibodies makes them powerful tools that can be used 
for the detection, quantitation, purification and 
neutralization of antigens. 

10 Polyclonal antibodies are produced in v ivo by 

immunizing animals, such as rabbits and goats, vith 
antigens, bleeding the animals and isolating polyclonal 
antibody molecules from the blood. Monoclonal antibodies 
are produced by hybridoma cells, which are made by 

15 fusing, in vitro , immortal plasmacytoma cells with 

antibody producing cells (Kohler, G. and C. Milstein, 
Nature, 256:495 (1975)) obtained from animals immunized 
in vivo with antigen. 

Current methods for producing polyclonal and mono- 

20 clonal antibodies are limited by several factors. First, 
methods for producing either polyclonal or monoclonal 
antibodies require an in vivo immunization step. This 
can be time consuming and require large amounts of 
antigen. Second, the repertoire of antibodies expressed 

25 in vivo is restricted by physiological processes, such as 
those which mediate self -tolerance that disable auto- 
reactive B cells (Goodnow, C.C., et, al . , N atu re , 334:676 
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(19 88); Goodnow, J.V., Basic and Clinical Immunology . Ed. 
5, Los Altos, CA, Large Redical Publications (1984); 
Young, C.R., Molec ular Immu nology * New York, Marcel 
Dekker (1984)). Third, although antibodies can exist in 

05 millions of different forms, each vith its own unique 
binding site for antigen, antibody diversity is 
restricted by genetic aechanisms for generating antibody 
diversity (Honjo, T. , Ann. Re? A Jlgmunol^. 1:499 (1983); 
Tonegawa, S. f Na ture:302 : 575 (1983)). Fourth, not ail 

10 the antibody molecules which can be generated will be 

generated in a given animal. As a result, raising high 
affinity antibodies to a given antigen can be very time 
consuming and can often fail. Fifth, the production of 
human antibodies of desired specificity is very 

15 problematical. 

A method of producing antibodies which avoids the 
limitations of presently -available methods, such as the 
requirement for immunization of an animal and in vivo 
steps, would be very useful, particularly if it made it 

20 possible to produce a wider range of antibody types than 
can be made using presently- available techniques and if 
it made it possible to produce human antibody types. 

Disclos ure o f the Inv ention 

The present invention relates to a method of produc- 

25 ing libraries of genes encoding antigen-combining 

molecules or antibodies; a method of producing antigen- 
combining molecules, also referred to as antibodies, 
which does not require an in vivo procedure, as is 
required by presently-available methods; a method of 

30 obtaining antigen-combining molecules (antibodies) of 
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selected or defined specificity which does not require an . 
in vivo procedure; vectors useful in the present method 
and antibodies produced or obtained by the method. 

The present invention relates to an in vitro process 

05 for synthesizing DNA encoding families of antigen- 
combining molecules or proteins. In this process, DNA 
containing genes encoding antigen- combining molecules is 
obtained and combined with oligonucleotides which are 
homologous to regions of the genes which are conserved. 

10 Sequence-specific gene amplification is then carried out 
using the DNA containing genes encoding antigen-combining 
proteins as template and the homologous oligonucleotides 
as primers. 

This invention also relates to a method of creating 

15 diverse libraries of DNAs encoding families of antigen- 
combining proteins by cloning the product of the in_yi_tro 
process for synthesizing DNA , described in the preceeding 
paragraph, into an appropriate vector (e.g., a plasmid, 
viral or retroviral vector). 

20 The subject invention provides an alternative method 

for the production of antigen- combining molecules, which 
are useful affinity reagents for the detection and 
neutralisation of antigens and the delivery of molecules 
to antigenic sites. The claimed method differs from 

25 production of polyclonal antibody molecules derived by 

immunization of live animals and from production of mono- 
clonal antibody molecules through the use of hybridoma 
cell lines in that it does not require an in vivo 
immunization step, as do presently available methods. 

30 Rather, diverse libraries of genes which encode antigen- 
combining sites comprising a significant proportion of an 
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animal's repertoire of antibody combining sites are made, 
as described in detail herein. These genes are expressed 
in living cells, from which molecules of desired 
antigenic selectivity can be isolated and purified for 
05 various uses. 

Antigen-combining molecules are produced by the 
present method in the following manner, which is 
described in greater deail below. Initially, a library 
of antibody genes which includes a set of variable 
10 regions encoding a large, diverse and random group of 
specificities derived from animal or human immunoglob- 
ulins is produced by amplifying or cloning diverse 
genomic fragments or cDNAs of antibody mRNAs found in 
antibody-producing tissue. 
15 I* an optional step, the diversity of the resulting 

libraries can be increased by means of random muta- 
genesis. The gene libraries are introduced into cultured 
host cells, which may be eukaryotic or prokaryotic, in 
which they are expressed. Genes encoding antibodies of 

20 desired antigenic specificity are identified, using a 

method described herein or known techniques, isolated and 
expressed in quantities in appropriate host cells, from 
which the encoded antibody can be purified. 

Specifically, a library of genes encoding 

25 immunoglobulin heavy chain regions and a library of genes 
encoding immunoglobulin light chain regions are con- 
structed. This is carried out by obtaining antibody- 
encoding DNA * which is either genomic fragments or cDNAs 
of antibody mRNAs, amplfying or cloning the fragments or 

30 cDNAs; and introducing them into a standard framework 
antibody gene vector, which is used to introduce the 
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antibody-encoding DNA into cells in which the DNA is 
expressed* The vector includes a framework gene encoding 
a protein, such as a gene encoding an antibody heavy 
chain or an antibody light chain which can be of any 

05 origin (human, non-human) and can be derived from any of 
a number of existing DNAs encoding heavy chain immuno- 
globulins or light chain immunoglobulins. Such vectors 
are also a subject of the present invention and are 
described in greater detail in a subsequent section. 

10 Genes from one or both of the libraries are introduced 
into appropriate host cells, in which the genes are 
expressed, resulting in production of a wide variety of 
antigen-combining molecules. 

Genes encoding antigen-combining molecules of 

15 desired specificity are identified by identifying cells 
producing antigen-combining molecules which react with a 
selected antigen and then obtaining the genes of 
interest. The genes of interest can subsequently be 
introduced into an appropriate host cell (or can be 

20 further modified and then introduced into an appropriate 
host cell) for further production of antigen- combining 
molecules, which can be purified and used for the same 
purposes- for which conventionally-produced antibodies are 
used. 

25 Through use of the method described, it is possible 

to produce antigen-combining molecules which are of wider 
diversity than are antibodies available as a result of 
known methods; novel antigen-combining molecules with a 
diverse range of specificities and affinities and 

30 antigen-combining molecules which are predominantly human 
in origin. Such antigen-combining molecules are a 
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subject of the present invention and can be used 
clinically for diagnostic, therapeutic and prophylactic 
purposes, as well as in research contexts, and for other 
purposes . 

Brief Description of the Drawings 

Figure 1 is a schematic representation of the method 
of the present invention by which antigen- combining 
molecules, or antibodies, are produced. 

Figure 2 is a schematic representation of amplifica- 
tion or cloning of IgM heavy chain variable region DNA 
from mRNA , using the polymerase chain reaction, 
ZgSSl-A shows the relevant regions of the poly adenylated 
mRNA encoding the secreted form of the IgM heavy chain. 
S denotes the sequences encoding the signal peptide which 
causes the nascent peptide to cross the plasma membrane. 
V, D and J together comprise the variable region. C„l, 
C H 2, and C fi 3 are the three constant domains of C/i. Hinge 
encodes the hinge region. C, B and Z are oligonucleotide 
PCR primers (discussed below). 

20 Z£B£i_B shows the reverse transcript DNA product of the 
mRNA prfmed by oligonucleotide 2, with the addition of 
poly-dC by terminal transferase at the 3' end. 
f * ne l-g is a schematic representation of the annealing of 
primer A to the reverse transcript DNA. 

25 Panel D shows the final double stranded DNA PCR product 
made utilizing primers A and B. 

shows the product of PCR annealed to primer C. 
ZS&Bl.* is a blowup of Panel E, showing in greater detail 
the structure of primer C. Primer C consists of two 
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parts: a 3' part complementary to IgM heavy chain mRNA 
as shown, and a 5' part which contains restriction site 
RE2 and spacer. 

lanel_G shows the final double stranded DNA PCR product 
made utilizing primers A and C and the product of the 
previous PCR (depicted in D) as template. The S , V, D. J 
regions are again depicted. 

Figure 3 is a schematic representation of the heavy 
chain framework vector pFHC. The circular plasmid 
(above) is depicted linearized (below) and its. relevant 
components are shown: animal cell antibiotic resistance 
marker; bacterial replication origin; bacterial cell 
antibiotic resistance marker; Cp enhancer; LTR containing 
the viral promoter from the Moloney MLV retrovirus DNA; 
PCR primer (D) ; cDNA cloning site containing restriction 
endonuclease sites, RE1 and RE2 , separated by spacer DNA; 
C/i exons; and poly A addition and termination sequences 
derived from the C„ gene or having the same sequence as 
the Cft gene . 

Figure 4 depicts a nucleotide sequence of the C 1 
exon of the Cft gene, and its encoded amino acid sequence 
(Panel A) The nucleotide coordinate numbers are listed 
above the 'line of nucleotide sequences. Panel B depicts 
the N-doped sequence, as defined in the text. 



25 Petailed_Descriptlon of the Inv>nf< 



on 



The present invention provides a method of producing 
antigen- combining molecules (or antibodies) which does 
not require an in_vivo immunization procedure and which 
makes it possible to produce antigen- combining molecules 
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prod '"I 41Ve "' ty 11 th °»" * ««»•«.. 

produced by curr.ntly-.v.u.Ho t.ehnl,u.. 

05 (Mtlbody P r.:.::: ) "::r:L:::. i,en - co - M '' 1 - 

antigen- combining specificities- - m »r*~* * 

e„„v a method of producine 

such antigen-combining molecules 

» 0 i*,.„i ecules - ai >tigen-combining 

molecules produced bv the Bll1 .i,^ „ 

. method and vectors useful in 

the method. The following is a descri Dti / 
10 of such libraries of the des «*Ption of generation 

Present method of producing 
antigen.combining molecules of selected specificity f„d 
o vectors useful in producing antigen-colin " g 
molecules of the present invention 
■ As described below, the process makes use of 
15 techniques which are known to those of skill in t v 

- can be applied as described herein to produce Ld"' 

-PH. and'clo £ ZlZ^T ^ ' '° 

- ,ound in «tibod y . pre du^ts^ 
to fu rther increase the diversity of 

K£j::rr antiwy — — 

VproKaryotic and eukaryotic) cells for the 
Pur po of expressing them; and screening protocol to 
25 detect genes encoding antibodies of the desired \\ 

specificity. A general outline of th "tigenic 
represented in Figure 1 " - th » d is 
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Constructio n of Libra ry of Genes Encoding 
Antigen-Combining Molecules 

A key step in the production of antigen-combining 
molecules by the present method is the construction of a 
-library- of antibody genes which include -variable" 
regions encoding a large, diverse, but random set of 
specificities. The library can be of human or non-human 
origin and is constructed as follows: 

Initially, genomic DNA encoding antibodies or cDNAs 
of antibody mRNA (referred to as antibody-encoding DNA) 
is obtained. This DNA can be obtained from any source of 
antibodyrproducing cells, such as spleen cells, 
peripheral blood cells. ly np h nodes, inflammatory tissue 
cells and bone marrow cells. It can also be obtained 
from a genomic library or cDNA library of B cells. The 
antibody-producing cells can be of human or non-human 
origin; genomic DNA or mRNA can be obtained directly from 
the tissue (i.e., without previous treatment to remove 
cells which do not produce antibody) or can be obtained 
after the tissue has been treated to increase 
concentration of antibody-producing cells or to select a 
particular type(s) of antibody-producing cells (i.e.. 
treated to^ enrich the content of antibody-producing 
cells). Antibody-producing cells can be stimulated by an 
agent which stimulates antibody mRNA production (e.g., 
lipopolysaccharide) before DNA is obtained. 

Antibody-encoding DNA is amplified and cloned using 
a known technique, such as the PGR using appropriately- 
selected primers, in order to produce sufficient quanti- 
ties of the DNA and to modify the DNA in such a manner 
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(..«.. by addition of . ppropri „ te restrictloa 

« r " " Ch " '""body g .ne 

further dtv.r.ifi.d. oslng . at . i . M , I . ^ 
order to produce e gr .. t . r dIverslty „ ^ 

10 1 'T^"" 8 •° 1 " U1 » » —1 .„ti g .» 

10 bindxng molecules. 

erureLT" f iotroduc.d Into .» 

v " ° f P """ Mention, »hich c.n b. . pliMld , 

:::: ic 1 »«- *»« «*. ..,»..!..': : r 

used .eke it p... IM . £or th . expressed 

•s . protein in th. ho.t cell 0 „. „ , "Pressed 

!0 useful i„ tk 'xpr.ssi.n rector 

useful in the_ p r.,.,t „. thod lB ref>rrtd co ^ 

fr..e.orx .ntihody g .„. vector. Vector, useful In the 
Present ..thod cont.in .ntihody c.n.t.nt r. gl on 

zzzxrz :.\r.: P :::::: rr he ; — ~ 

5 co.pn.in. . ..riehi. r. g lon ^.'^JZ^T* 
proper r. g i.t.r. The t.o r. g ion. pre.ent i, 

P«duct c,» he iron th. .... type of ...uoo^obuliT 

.oi:::;:." ro * tvo di£f — - *— 

Us> which ca * be eukaryotic or 
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prokaryotic. The libraries can be introduced into host 
cells separately or together. Introduction of the 
antibody-encoding DNA in vitro into host cells (by 
infection, transformation or transf ection) is carried out 
using known techniques, such as electroporation, 
protoplast fusion or calcium phosphate co -precipitation . 
If only one library is introduced into a host cell, the 
host cell will generally be one which makes the other 
antibody chain, thus making it possible to produce 
complete/functional antigen-binding molecules. For 
example, if a heavy chain library produced by the present 
method is introduced into host cells, the host cells will 
generally be cultured cells, such as myeloma cells or E^ 
coli, which naturally produce the other (i.e., light) 
chain of the immunoglobulin or are engineered to do so. 
Alternatively, both libraries can be introduced into 
appropriate host cells, either simultaneously or 
sequentially. 

Host cells in which the antibody-encoding DNA is 
expressed can be eukaryotic or prokaryotic. They can be 
immortalized cultured animal cells, such as a myeloma 
cell line which has been shown to efficiently express and 
secrete introduced immunoglobulin genes (Morrison. S.L., 
£!_£!•. Afln^HJ^Acad^Sci^, 507:187 (1987); Kohler, G . 
and C. Milstein. Eur, J. Immunol.. 6:511 (1976); Oi, 
V.T., et_al. , Immunoglobulin Gene Ei t pr>. K ^n^_ ; u 

f o rme d_Lym£h o id_Ce 1 1 s , 80:825 (1983); Davis, A.C. 
and M.J. Shulman. Immunol. Today. 10:119 (1989)). One 
host cell which can be used to express the antibody- 
encoding DNA is the J558L cell line or the SP2/0 cell 
line. 
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Cells expressing antigen. combining molecules with a 

desired specificity for a given antigen can then be 

selected by a variety of means, such as testing for 

reactivity with a selected antigen using nitrocellulose 

layering. The antibodies identified thereby can be of 

human origin, nonhuman origin or a combination of both 

That is, all or some of the components (e.g., heavy 

chain, light chain, variable regions, constant regions) 

can be encoded by 1>KA of human or nonhuman origin, which 

when expressed produces the encoded chimeric protein 

vhich, in turn, may be human, nonhuman or a combination 

of both. m such antigen- combining molecules, all or 

some of the regions (e.g., heavy and light chain variable 
and constant regions) are ^^^^ ^ ^ ^ ^ 

origin or of nonhuman origin, based on the source of the 

encodi »S the antigen-combining molecule region in 
question. For example, in the case in which DNA encoding 
»ouse heavy chain variable region is expressed in host 
cells, the resulting antigen-combining molecule has a 
heavy chain variable region of mouse origin. Antibodies 
Produced may be used for such purposes as drug delivery 
tumor imaging and other therapeutic, diagnostic and 
prophylactic uses. 

cbt.in.d. ti . lr gen „ „. y be t . oUt>4 m4 further 
.nt.g.niz.d t. create .ddi.io.el .ntig.n combining 
dlvereiry or entihodi.e of higher .m,i ty £ or entig.n , 
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Const ruc^on_of _lgmuno £ lob ul in Heavy Chain r. tii,^ 

and_Produc^n^Oncode A^tl g e 0 -Bin din R Hoi ..,,7.1 

The following is a detailed descriptioTof a" 
Specific experimental protocol which embodies the 
concepts described above. Although the following is a 
description of one particular embodiment, the same 
procedures can be used to produce libraries in which the 
immunoglobulin and the heavy chain class are different or 
in which light chain genes are amplified and cloned. The 
present invention is not intended to be limited to this 
example. In the embodiment presented below, a diverse 
heavy chain gene library is constructed. Using the 
principles described in relation to the heavy chain gene 
library, a diverse light chain gene library is also 
15 constructed. These are co-expressed in an immortal tumor 
cell capable of producing antibodies, such as plasma- 
cytoma cells or myeloma cells. Cells expressing antibody 
reactive to antigen are identified by a nitrocellulose 
filter overlay and antibody is prepared from cells 
identified as expressing it. As described in a subse- 
quent section, there are alternative methods of library 
construction, other expression systems which can be used, 
and alternative selection systems for identifying anti- 
body -producing cells or viruses. 

Step 1 in this specific protocol is construction of 
libraries of genes in E . coli which encode immunoglobulin 
heavy chains. This is followed by the use of random 
mutagenesis to increase the diversity of the library, 
which is an optional procedure. Step 2 is introduction 
of the library, by transfection, into myeloma cells. 
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Step 3 is identification of myeloma cells expressing 
antibody with the desired specificity, using the 
nitrocellulose filter overlay technique or techniques 
known to those of skill in the art. Step 4 is isolation 
of the gene(s) encoding the antibody with the desired 
specificity and their expression in appropriate host 
cells, to produce antigen- combining fragments useful for 
a variety of purposes. 

.Constructio n 

One key step i„ construction of the library of cDNAs 
encoding the variable region of mouse heavy chain genes 
is construction of an E^coli* plasmid vector, designated 
PFHC. pFHC contains a -framework" gene, which can be 
any antibody heavy chain and serves as a site into which 
the amplified cloned gene product (genomic DNA or cDNA of 
antibody mRNAs) is introduced. pFHC is useful as a 
vector for this purpose because it contains RE1 and RE2 
cloning sites. Other vectors which include a framework 
gene and other cloning sites can be used for this purpose 
as well. The framework gene includes a transcriptional 
promoter (e.g., a powerful promoter, such as a Moloney 
LTR (Mulligan. R.C., Injx^rimenta^^ 

^Bression. Kew York Adacemic Press, p. 155 (1983)) and"! 
C/i chain transcriptional enhancer to increase the level 
of transcriptions from the promoter (Gillies, s D et 
Si-. Cell, 33:717 (1983), a cloning site containing R^l 
and RE2; part of the C„ heavy chain gene encoding 
secreted protein; and poly A addition and termination 
sequences (Figure 3). The framework antibody gene vector 
of the present invention (pFHC) also includes a 
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selectable marker (e.g., an antibiotic resistance gene 
such as the neomycin resistance gene, neo R ) for animal 
cells; sequences for bacterial replication (ori); and a 
selectable marker (e.g., the ampicillin resistance gene, 
Amp ) for bacterial cells. The framework gene can be of 
any origin (human, non - human ) , and can derive from any 
one of « number of existing DNAs encoding heavy chain 
immunoglobulins (Tucker, P.W., et al . . Science. 206:1299 
(1979); Honjo, T. , et_al. , Cell, 18:559 (1979); Bothwell, 
A.L.M., et_al., Cell, 24:625 (1981); Liu, A.Y, et al. . 
Gene. 54:33 (1987); Kawakami, T. , et al.. Hue.. Acids. 
Res^, 8:3933 (1980)). In this embodiment, thl vector 
retains the introns between the Cgl. hinge, C H 2 and C H 3 
exons. The "variable region" of the gene, which includes 
the V, D and J regions of the antibody heavy chain and 
which encodes the antigen binding site, is deleted and 
replaced with two consecutive restriction endonuclease 
cloning sites, RE1 and RE2. The restriction endonuclease 
site RE1 occurs Just 3' to the LTR promoter and the 
restriction endonuclease site RE2 occurs within the 
constant region Just 3' to the J region (see Figure 3). 

Another key step in the production of antigen- 
combining' molecules in this embodiment of the present 
invention is construction in an E. coli vector of a 
library of cDNAs encoding the variable region of mouse 
immunoglobulin genes. In this embodiment, the pFHC 
vector, which includes cloning sites designated RE1 and 
RE2, is used for cloning heavy chain variable regions, 
although any cloning vector with cloning sites having the 
same or similar characteristics (described below) can be 
used. Similarly, a light chain vector can be designed, 
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using the .b.v. d.serib.d procedures and procedures kno „„ 

to • person of ordinary .kill in the art. 

•In this embodiment n u n.<i>i.H.. 

l ' "on- immune mouse spleens are 

fro. the spleen or fro. ,pl.. n processed in .„ch a aanner 
that it i. enriched for resting B o.lls. Enrich.ent of 
tl..u. results m . nor. unifor. r.pr...„t.tio» of 
antibody diversity i. the starting n.terial. 
Lymphocyte, o.n he purified fro* spleen using fic.ll 
gradients <Boy„n, A., Sc ss d i _J i _ 2 £_ci lnlc . 1 It , vest 
J!." <»..>>. , cell. . r . ^l^J,, 
(e.g., I o.lls) by panning » lth anti-JgM coated dishes 
(Vys.Cci, L.J. and ,.L. Sato, Iroo^Katl^,, . ScI 

(197.,,. ....... ct ^T^-~ 7b - 

11.2 reoeptor hut resting g ..„. do „«. „ 
oan he separated yet further fro. aotiv.t.d ells by 
M»U.. Further purifio.tion by sit. f r.oti.n.ti.n .» . 
LeiI Sorter results in a fa4-ri v v~ 

resting B cells. * Population of 

Poly A+ -mRNA from total mouse spleen is prepared 
according to published methods (Sambrook, J. e t al 

^i^^nin^^^o^r^Manuai, 2d EdT^oid 

Spring Harbor laboratorv Pres. r.u <. , 

MM«i\ „ * rress, Cold Spring Harbor, HY 

(1969)). Production of antibody mRNA o.n £ ir . t be 

.tl.„l.t.d by lip.polys.oob.rid. (LPS, «nder.son. d.A.. 

*J*: i^^J^. 145:15!! „„,„ . Flrst 

cMA is pr.p„. 0 to this nRNA p.pul.ti.n using .s pri.er 

« eli,o.„ol..tide. Z, uhioh is o..pl...n t .r y to C„ in 

Pile \"T " " ^ T " lS Prl "" tS « «- 

« 6 ur. 2. First str.nd cDHA is then elongated by the 

t.r.in.l transferase r.aotion with dCTP to for. a poly do 
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tail (Sambrook, J., et al. . Molecular Cloning: A 

Laboratory Manual, 2d Ed., Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY (1989)). 

This DNA product is then used as template in a 
polymerase chain reaction (PCR) to amplify cDNAs encoding 
anfibody variable regions (Saiki, R.K. , et al., Science. 
239:487 (1988); Ohara, 0., et al .. Proc^Hat l. Acad, S c"i. 
USA. 86:5673 (1989)). Initially, PCR is carried out with 
two primers: primer A and primer B, as represented in 
10 Figure 2. Primer A contains the RE1 site at its 5» end, 
followed by poly dG . Primer B is complementary to the 
constant (C H 1) region of the Cn gene. 3' to the J region 
and 5' to primer 2 (see Figure 2). Primer B is 
complementary to all Cfi genes, which encode the heavy 
chain of molecules of the IgM class, the Ig class 
expressed by all B cell clones prior to class switching 
(Schimizu, A. and T. Honjo, Cell, 36:801-803 (1984)) and 
present in resting B cells. The resultant PCR product 
includes a significant proportion of cDNAs encompassing 
20 the various V R regions expressed as IgM in the mouse. 

(The use of other primers complementary to the cDNA genes 
encoding the constant regions of other immunoglobulin 
heavy chains can be used in parallel reactions to obtain 
the variable regions expressed on these molecules, but 
25 for simplicity these are not described). 

Hext, the product of the first PCR procedure is used 
again for PCR with primer A and primer C. Primer C, like 
primer B, is complementary to the C/* gene 3» to J and 
just 5' to primer B (see Figure 2). Primer C contains 
30 the RE2 site at its 5' end. The RE2 sequence is chosen 
in such a manner that when it is incorporated into the 
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*. kn .« 0 ; t • 0 " , :: t, : 1 e v 17 one resi ° n ° f 

SSi^SSA, .6:5673 i^V*" ^S^***!-^ 

are .elected so that when the PGR - * 
these sites the Product Is cloned into 

antibJ a " «d the cloned 

antibody ge „e fragaents are brought back into «. 
frame with respect to th e * Pr ° per 
15 present i» pFH c Tbis r 4 -~«^"» ««• 

-nigene vh ch lacL he il """^ °* * 

i.acKs the intron normallv v 
J and the C 1 x- «»axiy present between 

«e C H 1 region of c„ (See Figure 3) The^ 

«;::; u ~ :r;,:: - - 
• — - ^::xr:.:::-r:::- ;:;:r ■ - f ~ 

r. S io! P 1 " 0 :' Uy ' dlVe " lty ° f the .Wl-',.,i. M . 
«ohnl,«. vkn<>lm tt thM . of ^ ^ i 

li.itl , " g rC,t '"" ,er "»«ti.™ of 

Uniting nucleotide concentred,,,, c v 
*novn „ Increa „ t „. lnf " J t , r " t 1 °' , of f" Ch "'""ens .re 
«nd result ,„ „ "fidelity of th. poly.erlz.tlon 
■>« result in production of cut.nt products 

represented in Figures 2 and * „ , 

PPHC Just 5- to «. The PCR or d ' **" 

The PCR product, after cleavage 
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with RE1 and RE2, is recloned into the framework vector 
pFHC. To the extent that mutation affects codons of the 
antigen binding region, this procedure increases the 
diversity of the binding domains. For example, if the 

05 starter library has a complexity of 10 6 elements, and an 
average of one mutation is introduced per complementarity 
determining region, and it is assumed that the 
complementarity determining region is 40 amino acids in 
size and that any of six amino acid substitutions can 

10 occur at a mutated codon, the diversity of the library 

can be increased by a factor of about 40 x 6, or 240, for 

single amino acid changes and 240 x 240, or about 
4 

6 x 10 , for double amino acid changes, yielding a final 
diversity of approximately 10 11 . This is considered to 

15 be in the range of the diversity of antibodies which 

animals produce (Tonegawa , S., Nature, 302 : 575 (1983)). 
Even greater diversity can be generated by the random 
combination of H and L chains, the result of co-expres- 
sion in host cells (see below). It is, thus, theoreti- 

20 cally possible to generate a more diverse antibody 

library In vitro than can be generated in_y iyo . This 
library of genes is called the "high diversity" heavy 
chain library. It may be propagated indefinitely in 
££ii- A high diversity light chain library can be 

25 prepared similarly. 

The framework vector for the light chain library, 
designated pFLC, includes components similar to those in 
the vector for the heavy chain library: the enhancer, 
promoter, a bacterial selectable marker, an animal 

30 selectable marker, bacterial origin of replication and 
light chain exons encoding the constant regions. For 
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pFLC, the animal selectable marker should differ from the 
animal selectable in pFHC. For example, if pFHC contains 
neo , pFLC can contain Eco gpt. 

A light chain library, which contains diverse light 
chain fragments, is prepared as described above for 
construction of the heavy chain library. In constructing 
the light chain library, the primers used are different 
from those described above for heavy chain library 
construction. In this instance, the primers are 
complementary to light chain mRNA encoding constant 
regions. The framework vector contains the light chain 
constant region exons. 

In troduction of the Library o f_Immu no globulin Chair, ft— 
into__Immo rtalized Animal c^u 

The library of immunoglobulin chain genes produced 
as described is subsequently introduced into a line of 
immortalized cultured animal cells, referred to as the 
•host- cells, in which the genes in the library are 
expressed. Particularly useful for this purpose are 
plasmacytoma cell lines or myeloma cell lines which have 
been shown to efficiently express and secrete introduced 
immunoglobulin genes (Morrison, S.L., et al . , Ann. N.Y. 
^L-Sci,. 507:187 (1987); Kohler, G. and C. MilsteinT 
----- J - teHSo U . 6:511 (1976); Galfre and C. Milstein,' 
Methods. Enzym o l^, 73:3 (1981); Davis, A.C. and M.J. 
Shulman, Immunol . Today, 10:119 (1989)). For example, 
the J558L cell line can be cotransf ected using electro- 
poration or protoplast fusion (Morrison, S.L.. et al. , 
An n . N.Y. Acad.^Sci^, 507:187 (1987)) and transfected' 
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cells selected on the basis of auxotrophic markers 
. present on light and heavy chain libraries. 

As a result of cotransf ormation and selection for 
markers on both light chain and heavy chain vectors, most 

05 transformed host cells will express several copies of 
immunoglobulin heavy and light chains from the diverse 
library, and will express chimeric antibodies (antibodies 
encoded by ail or part of two or more genes) (Nisonoff , 
A • » et al . f In The A nt ibody Molecule t Academic Press, NY 

10 p. 238 (1975)). These chimeric antibodies are of two 
types: those in which one chain is encoded by a host 
cell gene and the other chain is encoded by an exogen- 
ously introduced antibody gene and those in which both 
the light and the heavy chain are encoded by an exogenous 

15 antibody gene. Both types of antibodies will be 

secreted. A library of cells producing antibodies of 
diverse specificities is produced as a result. The 
library of cells can be stored and maintained in- 
definitely by continuous culture and/or by freezing. A . 

20 virtually unlimited number of cells can be obtained by 
this process. 

Isolation^ of Cells_Prpducing m Ant igen- Binding Molecules ; of 
Selected Spec ificity 

25 Cells producing antigen-binding molecules of 

selected specificity (i.e., which bind to a selected 
antigen) can be identified and isolated using 
nitrocellulose filter layering or known techniques. The 
same methods employed to identify and isolate hybridoma 

30 cells producing a desired antibody can be used: cells 
are pooled and the supernatants tested for reactivity 
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with antigen (Harlow, E. and D. Lane, Antibodies: A 
Laboratory Kanual , Cold Spring Harbor Laboratory, N.Y. , 
p. 283 (1988). Subsequently, individual clones of cells 
are identified, using known techniques. A preferred 
method for identification and isolation of cells makes 
use of nitrocellulose filter overlays, which allow the 
screening of a large number of cells. Cells from the 
library of transfected myeloma cells are seeded in 10 cm 2 
Petri dishes in soft agar (Cook, W.D. and M.D. Scharff 
PNAS, 74:5687 (1977).; Paige, C.J., et al .. Method s in 
En zym o l,,, 150:257 (1987)) at a density of 10 4 colony 
forming units, and allowed to form small colonies 
(approximately 300 cells). A large number of dishes 
- (>100) may be so seeded. Cells are then overlayed with a 
thin film of agarose «lmm) and the agarose is allowed to 
harden. The agarose contains culture medium without 
serum. Nitrocellulose filters (or other protein-binding 
filters) are layered on top of the agarose, and the 
dishes are incubated overnight. During this time, 
antibodies secreted by the cells will diffuse through the 
agarose and adhere to the nitrocellulose filters. The 
nitrocellulose filters are keyed to the underlying plate 
and removed for processing. 

The method for processing nitrocellulose filters is 
identical to the methods used for Western blotting 
(Harlow, E. and D. Lane, Anti bodies: Labor atory 
Cold Spring Harbor, N.Y. , p. 283 (1988)). The antibody' 
molecules are adsorbed to the nitrocellulose filter. The 
filters, as prepared above, are then blocked. The 
desired antigen, for example, keyhole lymphet hemocyanin 
(KLH), which has been iodinated with radioactive 125 I, is 
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then applied in Western blotting buffers to the filters. 
(Other, non radiographic methods can be used for 
detection). After incubation, the filters are washed and 
dried and used to expose autoradiography film according 

05 to standard procedures. Where the filters have adsorbed 
antibody molecules which are capable of binding KLH , the 
autoradiography film will be exposed. Cells expressing 
the KLH reactive antibody can be identified by 
determining the location on the dish corresponding to an 

10 exposed filter; cells identified in this manner can be 
isolated using known techniques. Cells which are 
isolated from a region of the dish can then be 
rescreened, to insure the isolation of the clone of 
antigen-binding molecule-producing cells. 

15 Isolatio n of Genes Encoding., AntjLg en-Bi nding^Molecules of 
Selected Speci f iclty_and_ Purification of Encoded 
Antl pen-Bindin g Molecules 

The gene(s) encoding an antigen-binding molecule of 
selected specificity can be isolated. This can be 

20 carried out, for example, as follows: primers D and C 
(see Figures 2 and 3) are used in a polymerase chain 
reactionT to produce all the heavy chain variable region 
genes introduced into the candidate host cell from the 
library. These genes are cloned again in the framework 

25 vector pFHC at the RE1 and RE2 sites. Similarly, all the 
light chain regions introduced into the host cell from 
the library are cloned into the light chain vector, pFLC. 
Members of the family of vectors so obtained are then 
transformed pairwise into myeloma cells, which are tested 

30 for the ability to produce and secrete the antibody with 
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the desired selectivity. Purification of the antibody 
from these cells can then be accomplished using standard 
procedures (Johnstone, A. and R. Thorpe. Immunochem. in 
Practice, Blaekwell Scientific, Oxford, p. 27 (1982); 
05 Harlow, E. and D. Lane, Antibod ies: A Lab oratory Manual. 
Cold Spring Harbor Laboratory, N.Y. , p. 283 (1988)). 

Alteration of Affinity og^ntigen^Mnding_Molecules 
It is also possible to produce antigen-binding 
molecules whose affinity for a selected antigen is 
10 altered (e.g., different from the affinity of a 

corresponding antigen-binding molecule produced by the 
present method). This can be carried out, for example, 
to increase the affinity of an antigen-binding molecule 
by randomly mutagenizing the genes isolated as described 
15 above using previously-described mutagenesis methods. 
Alternatively, the variable region of antigen-binding 
molecule-encoding genes can be sequenced and site 
directed mutagenesis performed to mutate the comple- 
mentarity determining regions (CDR) (Rabat, E.A., 
20 ISSHSoI^, 141: S 25-36 (1988)). Both processes result in 
production of a sublibrary of genes which can be screened 
for antigen-binding molecules of higher affinity or of 
altered affinity after the genes are expressed in myeloma 
cells . 

25 Alternative Materials and Procedures for U se in the 
Present Method 

In addition to those described above for use in the 
method of the present invention, other materials (e.g., 
starting materials, primers) and procedures can be used 



WO 91/10737 



PCT/US91/00209 



-25- 

in carrying out the method. For example, use of PCR 
technology to clone a large collection of cDNA genes 
encoding variable regions of heavy chains has been 
described above. Although primers from the C/i class were 
05 described as being used in unidirectional nested PCR, the 
present invention is not limited to these conditions. 
For example, primers from any of the other heavy chain 

classes (C-f^* 67^, c ?2b * ^ a exam P^ e ) or f rom light 

chains can be used. Cp was described as of particular 

10 use because of the fact that the entire repertoire of 
heavy chain variable regions are initially expressed as 
IgM. Only following heavy-chain class switching are 
these variable regions expressed with a heavy chain of a 
different class (Shimizu, A. and T. Honjo, Cell , 

15 36:801-803 (1984)). In addition, the predominant 
population of B cells ip nonimmune spleen cells is 
IgM + -cells (Cooper, M.D. and P. Burrows, In 
Immunoglobulin Genes , Academic Press, N.Y. p. 1 (1989)). 
Although unidirectional nested PCR amplification is 

20 described above, other PCR procedures, as well as other 
DNA amplification techniques can be used to amplify DNA 
as needed in the present invention. For example, 
bidirectional PCR amplification of antibody variable 
regions can be carried out. This approach requires use 

25 of multiple degenerate 5 r primers (Orlandi, R. , et al. t 
Pr oc. Natl. Acad. Sci. USA, 86:3833 (1989); Sastry, L. , 
et_al . , Proc. Natl. Acad, , Sci^JPSA , 86:5728 (1989)). 
Bidirectional amplification may not pick up the same full 
diversity of genes as can be expected from unidirectional 

30 PCR. 
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In addition, methods of introducing further 
diversity into the antibody library other than the method 
for random mutagenesis utilizing PGR described above can 
be used. Other methods of random mutagenesis, such as 

05 that described by Sambrook, et al. (Sambrook, J . , et al . , 
Molecular Cloning: A Laboratory Manual . Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N.Y. (19S9)) 
can be used, as can direct mutagenesis of the comple- 
mentarity determining regions (CDRs). 

10 Framework vectors other than one using a mouse 

heavy chain constant region, which contains the Cfi 
enhancer and introns and a viral promoter (described 
previously) can be used for . inserting the products of 
PGR. The vectors described were chosen for their 

15 subsequent use in the expression of the antibody genes, 

but any eukaryotic or prokaryotic cloning vector could be 
used to create a library of diverse cDNA genes encoding 
variable regions of antibody molecules. The inserts from 
this vector could be transferred to any number of 

20 expression vectors. For example, other framework vectors 
which include intronless genes can be constructed, as can 
other heavy chain constant regions. In addition to 
plasmid vectors, viral vectors or retroviral vectors can 
be used to introduce genes into myeloma cells. 

25 The source for -antibody molecule mRNAs can also be 

varied. Purified resting B lymphocytes from mouse 
nonimmunized spleen are described above as such a source. 
However, total spleens (immunized or not) from other 
animals, including humans, can be used, as can any source 
30 of antibody-producing cells (e.g., peripheral blood, 
lymph nodes, inflammatory tissue, bone marrow). 
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Introduction of H and L chain gene DNA into myeloma 
cells using cotransf ormation by electroporation or 
protoplast fusion methods is described above (Morrison, 
S.L. and V.T. 0i f Adv. Immunol. . 44:65 (1989)). However, 

05 any means by which DNA can be introduced into living 
cells in vivo can be used, provided that it does not 
significantly interfere, with the ability of the 
transformed cells to express the introduced DNA. In 
fact, a method other than cotransf ormation , can be used. 

10 Cotransf ection was chosen for its simplicity, and because 
both the H and L chains can be introduced into myeloma 
cells. It may be possible to introduce only the H chain 
into myeloma cells. Moreover, the H chain itself in many 
cases carries sufficient binding affinity for antigen. 

15 However, other methods can also be used. For example, 
retroviral infection may be used. Replication- incompe - 
tent retroviral vectors can be readily constructed which 
can be packaged into infective particles by helper cells 
(Mann, R. , et al,, Cell , 33:153-159 (1903)). Viral 

20 titers of 10^ infectious units per ml. can be achieved, 
making possible the transfer of very large numbers of 
genes, into myeloma cells. 

Further increases in the diversity of antibody- 
producing cells than results from the method described 

25 above can be generated if light and heavy chain genes are 
introduced separately into myeloma cells. Light chain 
genes can be introduced into one set of myeloma cells 
with one selectable marker, and heavy chains into another 
set of cells with a different selectable marker. Myeloma 

30 cells containing and expressing both H and L chains could 
then be generated by the highly efficient process of 
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polyethylene glycol mediated cell fusion (Ponteeorvo, G. f 
So matic Cell Genetics . 1:397 (1975)). Thus, a method of 
screening diverse libraries of antibody genes using 
animal cells is not limited by the number of cells which 
05 can be generated, but by the number of cells which can be 
screened. 

Methods of identifying antigen-binding molecule- 
expressing cells expressing an antigen-binding molecule 
of selected specificity other than the nitrocellulose 

10 filter overlay technique described above can be used. An 
important characteristic of any method is that it be 
useful to screen large numbers of different antibodies. 
With the nitrocellulose filter overlay technique, for 
example, if 300 dishes are prepared and 10* independent 

15 transformed host cells per dish are screened, and if, on 
average, each cell produces ten different antibody 
molecules, then 300 x 10 4 x 3 , or about 10 7 different 
antibodies can be screened at once. However, if the 
antibody molecules can be displayed on the cell surface, 
20 still larger numbers of cells can be screened using 
affinity matrices to pre-enrich for antigen-binding 
cells. There are immortal B cell lines, such as BCL^, 
which will express IgM both on the cell surface and as a 
secreted form (Granovicz, E.S., et r al. , J^Immunol. , 
25 125:976 (1980)). If such cells are infected by 

retroviral vectors containing the terminal Cm exons , the 
infected cells will likely produce both secreted and 
membrane bond forms of IgM (Webb, C.F., et al. . J . 
Immunol^, 143:3934-3939 (1989)). Still other methods can 
30 be used to detect antibody production. If the host cell 
is E. coli. a nitrocellulose overlay is possible, and 
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such methods have been frequently used to detect E. coll 
producing particular proteins (Sambrook, J., et al . , 
Molecular Cloning: A Laboratory Manual , Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N.Y. 

05 (1989)). Other methods of detection are possible and one 
in particular, which involves the concept of "viral 
coating", is discussed below. 

Viral coating can be used as a means of identifying 
viruses encoding antigen-combining molecules. In this 

10 method, a viral vector is used to direct the synthesis of 
diverse antibody molecules. Upon lytic infection of host 
cells, and subsequent cell lysis, the virus becomes 
"coated 11 with the antibody product it directs. That is, 
the antibody molecule becomes physically linked to the 

15 outside of a mature virus particle, which can direct its 
synthesis. Methods for viral coating are described 
below. Viruses coated by antibody can be physically 
selected on the basis of their affinity to antigen which 
is attached to a solid support. The number of particles 

20 which can be screened using this approach is well in 

9 11 
excess of 10 and it is possible that 10 different 

antibody genes could be screened in this manner. In one 

embodiment, an affinity matrix containing antigen used to 

purify those viruses encoding antibody molecules with 

25 affinity to antigen and which coat the surface of the 
virus which encodes those antibodies is used. 

One method of viral coating is as follows: A 
diverse library of bacteriophage A encoding parts of 
antibody molecules that are expressed in infected £. coli 

30 and which retain the ability to bind antigens is created, 



WO 91/10737 



PCT/US91/00209 



•30- 

using known techniques (Orlandi, R. , et al. Prpc. Natl., 
Acad. Sci. PSA, 86:3833 (1989); Huse, W.D., etjl. , 
i£i£S££t 246:1275 (1989); Better, M. , et al,. Science, 
240:1041 (1988); Skerra, A. and A. Pluckthon, Science, 
05 1£0:1°3B (1988)). Bacteria infected with phage are 

embedded in a thin film of semisolid agar. Greater than 
10 infected bacteria may be plated in the presence of an 
excess of uninfected bacteria in a volume of 1 ml of agar 
and spread over a 10 cm surface. The agar contains 

10 monovalent antibody "A" (Parham, P, , In Hand book of 

Experimental Immunology: Immunochem. , Blackwell 

Scientific Publishers, Cambridge, MA, pp, 14.1-14.23 
(1986)), which can bind the X coat proteins and which has 
been chemically coupled to monovalent antibody "B n , which 

15 can bind an epitope on all viral directed antibody 

molecules. Monovalent antibodies are used to prevent the 
crosslinking of viral particles. Upon lytic burst, 
progeny phage particles become effectively cross linked 
to the antibody molecule they encode. Because lysis 

20 occurs in semisolid medium, in which diffusion is slow, 
cross linking between a given phage and the antibody 
encoded by another phage is minimized. A nitrocellulose 
filter (oj other protein binding filter) is prepared as 
an affinity matrix by adsorbing the desired antigen. The 

25 filter is then blocked so that no other proteins bind 

nonspecifically . The filter is overlayed upon the agar, 
and coated phage are allowed to bind to the antigen by 
way of their adherent antibody molecules. Filters are 
washed to remove nonspecifically bound phage. 

30 Specifically bound phage therefore represent phage 
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encoding antibodies with the desired specificity. These 
can now be propagated by reinfection of bacteria. 

Thus the present invention makes it possible to 
produce antigen-binding molecules which, like antibodies 
05 produced by presently-available techniques, bind to a 
selected antigen (i.e., having binding specif ity) . 
Antibodies produced as described can be used, for 
example, to detect and neutralize antigens and deliver 
molecules to antigenic sites. 

10 EXAMPLE,! Amplification of IgM Heavy g Chain^Variable 

Re gion DNA from mRNA 
IgM heavy chain variable DNA is amplified from mRNA 
by the procedure represented schematically in Figure 2. 
In Figure 2, Panel A depicts the relevant regions of the 

15 poly adenylated mRNA encoding the secreted form of the 
IgM heavy chain. In Panel A, S denotes the sequences 
encoding the signal peptide which causes the nascent 
peptide to cross the plasma membrane, a necessary step in 
the processing and secretion of the antibody. V, D and J 

20 derive from separate exons and together comprise the 

variable region. C H l t C H 2 , and C H 3 are the three constant 
domains of C/i . "Hinge" encodes the hinge region. C, B 
and Z are oligonucleotide PCR primers used in the 
amplification process. The only constraints on Primers B 

25 and 2 are that they are complementary to the mRNA , and 
occur in the order shown relative to C. Primer C, in 
addition to being complementary to mRNA, has an extra bit 
of sequence at its 5' end which allows the cloning of its 
PCR product. This is described below. Panel B depicts 

30 the reverse transcript DNA product of the mRNA primed by 
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oligonucleotide Z f with the addition of poly-dC by 
terminal transferase at the 3' end of the product. Panel 
C depicts the annealing of primer A to the reverse 
transcript DNA represented in Panel B. Primer A contains 

05 the restriction endonuclease site RE1, with additional 
DNA at its 5 r end. The constraints on the RE1 site are 
described in Example 2. Panel D depicts the final double 
stranded DNA PCR product made utilizing primers A and B. 
Panel E depicts the PCR product shown in Panel D annealed 

10 to Primer C. Panel F is a blow up of panel E showing the 

structure of primer C. Primer C consists of two parts: 

« 

a 3' part complementary to IgM heavy chain mRNA as shown, 
and a 5 # part which contains restriction site RE 2 and 
spacer. Constraints on RE2 are described in Example 2. 
15 Panel G depicts the final double stranded DNA PCR product 
utilizing Primers A and C and the product of the previous 
PCR (depicted in Panel D) as template. The S, V, D, J 
regions are again depicted. 

EXAMPLE 2 Construction of Heayy^ghain Framework Vector 

20 pFHC 

A he*vy chain framework vector, designated pFHC, is 
constructed, using known techniques (See Figure 3). It 
is useful for introducing antibody-encoding DNA into host 
cells, in which the DNA is expressed, resulting in 

25 antibody production. The circular plasmid (above) is 

depicted linearized (below) and its relevant components 

are shown. The neomycin antibiotic resistance gene 
R 

(neo ) is useful for selecting transformed animal cells 

( S amb rook, J . , et al . , Molecular Cloningj A Laboratory 

30 Manual , 2d Ed., Cold Spring Harbor Laboratory Press, Cold 
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Spring Harbor, NY (1989)). The bacterial replication 
origin and ampicillin antibiotic resistance genes, useful 
respectively, for replication in E. c oll and rendering 
coll resistant to anpicillin, can derive from any number 

05 of bacterial plasmids , including PBR322 (Sambrook, J . . e t 

Si- » Molecular Clonin p: A Laboratory Manual, 2d Ed. , 

Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
NY (1989)). The enhancer, which derives from the 
intron between exons J and C R 1 of the Cp gene, derives 

10 from any one of the cloned C/i genes (Kawakami, T . , et 
al. f Nucl ei c Acids Research. 8:3933 (1980); Honjo, T., 
Ann. Rev. Immun ol.. 1:499 (1983)) and increases levels of 
transcription from antibody genes. LTR contains the 
viral promoter from the Moloney MLV retrovirus DNA 

15 (Mulligan , R . C . t Experim ental Manipulatio n of_Gene 

Expression, New York Academic Press, p. 155 (1983)). 
D represents the PCR primer described in the text, 
depicted in its 5' to 3' orientation. The only con- 
straints on D are its orientation, its complementarity to 

20 pFHC and its order relative to the RE1 and RE2 cloning 
sites. Preferably. D is within 100 nucleotides of RE1. 
The cDNA cloning site contains restriction endonuclease 
sites REl- r and RE2, separated by spacer DNA which allows 
their efficient cleavage. The constraints on RE1 and RE2 

25 are described below. The C/i exons, as described in the 
text and literature, direct the synthesis of IgM heavy 
chain. Only part of C H 1 is present, as described below. 
C H 3 is chosen to contain the Cjis region which specifies a 
secreted form of the heavy chain ((Kawakami, T . , et^al . , 

30 Nucleic_Acids Research. 8:3933 (1980); Honjo, T. , Anru 
5£^i52HS£l^. 1:499 (1983)). Finally, pFHC contains 
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poly A addition and termination sequences which can be 
derived from the C/i gene itself (Honjo, T. , Ann. Rev. 
Immunol., 1:499 (1983); Kawakami, T. , et al .,. Nucleic 
Acids Research, 8:3933 (1980)). One potential advantage 
of using the entire Cfi gene is that in some host cell 
systems, a membrane bound and secreted form of IgM may be 
expressed (Granowicz, E.S., et al . , J. Immunol . 125:976 
(1980)). 

The plasmid can be produced by combining the 
individual components, or nucleic acid segments, depicted 
in Figure 3, using PCR cassett assembly (See below). 
Because the entire nucleotide sequence of each component 
is defined, the entire nucleotide sequence of the plasma 
is defined. 

The constraints on RE1 are simple. It should be the 
sole cleavage site on the plasmid for its restriction 
endonuclease. The choice of RE1 can be made by computer 
based sequence analysis (Intelligenetics Suite, Release 
5:35, Intelligenetics). 

The constraints on RE 2 are more complex. First, it 
must be the sole cleavage site on the plasmid for its 
restriction endonuclease, as described for RE1 . 
Moreover, ithe RE2 site must be such that when the PCR 
product is inserted, a gene is thereby created which is 
25 capable of directing the synthesis of a complete IgM 
heavy chain. This limits the choices for RE2, but the 
choices available can be determined by computer based 
sequence analysis. The choices can be determined as 
follows. First, a list of restriction endonucleases that 
30 do not cleave pFHC is compiled (see Table 1). 
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TABLE 1 



Non-CuttltiR Enzymes for the Mouse Cn Gene 



05 



10 



15 



20 



25 



AatH 


Ahall 


Asel 


Avrll 


Bgll 


BspHI 


BSSHII 


BS ZD 1 


cial 


Dral 


EagI 


EcoRI 


EcoRV 


Fspl 


Hgal 


Hindi 


Hpal 


KphI 


Mlul 


Nael 


Narl 


Ndel 


NotI 


Nrul 


PaeR7I 


Pvul 


RsrII 


SacII 


Sail 


Seal 


Sfll 


SnaBI 


Spel 


SphI 


Sspl 


StuI 


Tthllll 


Xbal 


Xhol 


called the 


"rare non-cutters." 


Next, 



sequence of C^l is rewritten with "N" at the third 
position of each codon and entered into the computer. 
This is oalled the *N-doped sequence" (See Figure 4). 
Next, the rare non-cutters are surveyed by computer 
analysis for those which will cleave the N-doped 
sequence. The search program will show a possible 
restriction endonuclease site, assuming a match between N 
and the restriction endonuclease cutting site. For 
example, with 39 rare non-cutters, 22 will cleave the 
N-doped sequence of C/i C^l , many of them several times 
(see Table 2). In this table, "Def" means a definite cut 
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ite, of which there are none, because of the Ns . "Pos" 
eans a possible cleavage site at the indicated nucleo- 
ide position if N is chosen appropriately. "Y" 
ndlcates any pyrimidine , "R" indicates any purine and 
N" indicates any nucleotide. The nucleotide positions 
efer to coordinates represented in Figure 4. 
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TABLE 2 



05 



10 



15 



20 



25 



30 



35 



40 



45 



ENZYME 


RECOGNITION 


CUT SITE 




Aatll 


(GACGTC) • 


Def 


: none 








Pos 


: 250 


309 


Ahall 


(GRCGYC) 


Def 


: none 








Pos 


: 247 


306 


Avrll 


(CCTAGG) 


Def 


: none 








Pos 


: 204 




BspHI 


(TCATGA) 


Def 


: none 








Pos 


: 138 




fisshll 


(GCGCGC) 


Def 


: none 








Pos 


: 169 




EcoRI 


(GAATTC) 


Def 


: none 








Pos 


: 195 


334 


EcoRV 


(GATATC) 


Def 


: none 








Pos 


: 214 




Hgal 


(GACGCNNNNN) 


Def 


: none 






(NNNNNNNNNNGCGTC) 


Pos 


: 284 




Hindi 


(GTYRAC) 


Def 


: none 








Pos 


: 183 


220 


Hpal 


(GTTAAC) 


Def 


: none 








Pos 


: 220 




Kpnl 


(GGTACC) 


Def 


: none 








Pos 


: 408 




Nrul 


(TCGCGA) 


Def 


: none 








Pos 


: 174 


193 


PaeR7 


(CTCGAG) 


Def 


: none 








Pos 


: 190 


339 


Pvul 


(CGATCG) 


Def 


: none 








Pos 


178 




Seal 


(AGTACT) 


Def 


: none 








Pos 


209 


266 


Spel_\ 


(ACTAGT) 


Def 


. none 








Pos : 


131 


167 


SphI 


(GCATGC) 


Def : 


none 








Pos : 


338 




Sspl 


(AATATT) 


Def : 


none 








Pos : 


371 




StuI 


(AGGCCT) 


Def : 


none 








Pos : 


149 




Tthllll 


(GACNNNGTC) 


Def : 


none 








Pos : 


212 




Xbal 


(TCTAGA) 


Def : 


none 








Pos : 


338 




Xhol 


(CTCGAG) 


Def : 


none 








Pos : 


190 


339 



303 



284 
359 
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Most of these cleavage sites (about 60%) are compatible 

with the amino acids specified by C„l. Therefore, it is 

xi 

possible to mutate C^l to create a unique site for such 
an enzyme without altering the amino acid sequence 
05 incoded by Cgl. One sequence which illustrates this is 
shown below: 

1) ala met gly cys leu ala arg asp... 

2) ...GCC ATG GGC TGC CTA GCC CGG GAC... 

3) ...GCC ATG GGC TGC, CTA GCG CGC GAC... 

BssHII 

Line 1 represents part of the actual amino acid 
sequence specified by the mouse Cp C R 1 gene region, and 
line 2 is the actual nucleotide sequence. By changing 
the sequence to the indicated nucleotides underlined on 
line 3, a cleavage site for the rare non-cutter BssHII is 
created. The new sequence (containing the BssHII site) 
GCG CGC still encodes the identical amino acid sequence. 
Therefore the sequence of the primer C is chosen to be 
the complement of line 3, and RE2 is the BssHII site. 
Such a primer will function in the PCR and vector 
construction as desired. Other examples are possible, 
and the same process can be used in designing vectors and 
primers for cloning light chain variable regions. 

The choice for primer C puts a constraint on pFHC. 
In the example shown, the C R 1 region contained on pFHC 
must begin at its 5' end with the mutant sequence GCG 
CGC. Such mutant fragments can be readily made by the 
process of PCR cassette assembly described below. 
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The process of PCR cassette assembly is a method of 
constructing plasmid molecules (in this case the plasmid 
pFHC) from fragments of DNA of known nucleotide sequence. 
One first compiles a list of restriction endonucleases 

05 that do not cleave any of the fragments. Each fragment 
is then Individually PCR amplified using synthesized 
oligonucleotide primers complementary to the terminal 
sequences of the fragment. These primers are synthesized 
to contain on their 5' ends restriction endonuclease 

10 cleavage sites from the compiled list. Thus, each PCR 
product can be so designed that each fragment can be 
assembled one by one into a larger plasmid structure by 
cleavage and ligation and transformation into E. coli. 
Using this method, it is also possible to make minor 

15 modifications to modify the terminal sequence of the 

fragment being amplified. This is done by altering the 
PCR primer slightly so that a mismatch occurs.- In this 
way it is possible to amplify the Cji gene starting 
precisely from the desired point in C H 1 (as determined by 

20 oligo C above) and creating the RE2 endonuclease cleavage 
site. 



25 



WO 91/10737 



PCT/US91/00209 



05 



10 



15 



20 



-40- 
CLAIM S 

1. An in_Zl£r° process for synthesizing DNA encoding a 
family of antigen- combining proteins, comprising the 
steps of: 

a) obtaining DNA containing genes encoding 
antigen-combining proteins; 

b) combining the DNA containing genes encoding 
antigen-combining proteins with sequence 
specific primers which are oligonucleotides 
homologous to conserved regions of the genes; 
and 

c) performing sequence specific gene 
amplification. 

2. DNA encoding a family of antigen- combining proteins 
produced by the process of Claim 1. 

3. The process of Claim 1 wherein sequence specific 
gene amplification is performed by the polymerase 
chain reaction. 

« 

4. The process of Claim 3 wherein the sequence specific 
primers are bidirectional. 

5. The process of Claim 3 wherein the sequence specific 
primers are nested unidirectional primers. 

6. The process of Claim 1 wherein the antigen- combining 
proteins are immunoglobulins. 
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7. The process of Claim 6 wherein the immunoglobulins 
are selected from the group consisting of heavy 
chains and light chains. 

8. The process of Claim 7 wherein the heavy chains are 
05 ii chains, 

9. The process of Claim 1 wherein the DNA containing 
genes encoding antigen- combining proteins is cDNA of 
RNA from antibody-producing cells. 

10. The process of Claim 1 wherein the DNA containing 

10 genes encoding antigen-combining proteins is genomic 

DNA from antibody-producing cells. 

11. The process of Claim 8 wherein the antigen-combining 
proteins are of mammalian origin. 

12. The process of Claim 1 wherein the primers are 

15 oligonucleotides homologous to conserved regions of 

the constant regions of immunoglobulin genes. 

13. The process of Claim 1 wherein the primers are 
oligonucleotides homologous to the conserved regions 
of the variable regions of immunoglobulin genes. 

20 14. The process of Claim 1 wherein the primers contain 
at least one restriction endonuclease cloning site. 

15. The process of Claim 1 wherein the primers are 
selected from the group consisting of 
oligonucleotide B of Figure 2 and oligonucleotide C 
25 of Figure 2. 
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16. A method of creating a diverse starter library of 

DNAs encoding families of antigen-combining proteins 
comprising cloning the product of Claim 1 into an 
appropriate vector. 

05 17. A diverse starter library of DNAs encoding families 
of antigen-combining proteins produced by the method 
of Claim 14. 

18. The method of Claim 16 wherein the vector is a 
prokaryotic vector or a eukaryotic vector. 

10 19. The method of Claim 16 wherein the vector is a viral 
vector or a retroviral vector. 

20. The method of Claim 16 wherein the vector is a 
plasmid. 

21. The method of Claim 20 wherein the plasmid is 

15 selected from the group consisting of pFHC and pLHC . 

22. The method of Claim 16 wherein the vector is 
selected from the group consisting of expression 
vectors and cloning vectors. 

23. The method of Claim 22 wherein the expression vector 
20 is appropriate for expression of the variable region 

of an antigen-combining protein as a chimeric 
molecule in register with a framework protein. 

24. The method of Claim 23 wherein the framework protein 
is an immunoglobulin. 
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25. The method of Claim 24 wherein the immunoglobulin is 
all or a portion of the constant region of the /* 
heavy chain, 

26. The Method of Claim 16 further comprising creating a 
05 collection of viral particles from viral vector- 
based libraries of DNA encoding antigen-combining 
proteins by the process of introducing viral vectors 
into host cells in which they replicate and form 
viral particles. 

10 27. A method of producing a high diversity library of 

DNA encoding families of antigen- combining proteins 
comprising mutagenizing the product of Claim 16. 

28. A high diversity library of DNA encoding families of 
antigen-combining proteins produced by the method of 

15 Claim 27. 

29. The method of Claim 27 wherein mutagenizing is 
carried out by random chemical mutagenesis. 

30. The ^method of Claim 27 wherein mutagenizing is 
carried out by performing the polymerase chain 

20 reaction under limiting nucleotide conditions. 

31. The method of Claim 27 wherein mutagenizing is 
carried out in such a manner that mutagenesis is 
limited to DNA encoding variable regions of the 
antigen-combining protein. 

25 32. A process of producing a diverse population of host 
cells which comprises introducing into host cells 
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DNA of the starter library or high diversity 
libraries of antigen-combining proteins. 

33. Host cells produced by the method of Claim 32. 

34. The process of Claim 32 vherein the host cells are 
05 prokaryotic. 

35. The process of Claim 32 vherein the host cells are 
eukaryotic . 



10 



36. The process of Claim 35 Vherein the host cells are 
selected from the group consisting of immortalized 
cultured mammalian cells. 



37. The process of Claim 36 vherein the immortalized 

cultured mammalian cells are selected from the group 
consisting of myelomas and plasmacytomas. 



15 



38. The process of Claim 32 vherein the libraries 

encoding families of antigen- combining proteins are 
introduced into host cells by a method selected from 
the" group consisting of: electroporation, calcium 
phosphate coprecipitation , protoplast fusion, viral 
infection, and cell fusion. 



20 39. The process of Claim 32 vherein the libraries of 

DNAs encoding families of antigen-combining proteins 
is contained In an expression vector. 



25 



40. The process of Claim 32 vherein the DNAs encoding 
families of ant igen- combining proteins encode 
antigen-combining proteins selected from the group 
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consisting of immunoglobulin heavy chain variable 
regions or immunoglobulin light chain variable 
regions. 

41. The process of Claim 40 wherein DNAs encoding 
05 immunoglobulin heavy chain variable regions are 

Introduced simultaneously with or sequentially to 
DNAs encoding immunoglobulin light chain variable 
regions. 

42. The method of Claim 32 further comprising 

10 identifying cells which produce antigen- combining 

molecules of selected specificity. 

43. The method of Claim 42 wherein identifying of cells 
which produce antigen- combining molecules of 
selected specificity is carried out by assaying 

15 cellular supernatants for antigen- combining 

activity. 

44. The method of Claim 42 wherein identifying of cells 
which produce antigen-combining molecules of 
selected specificity is carried out by a 

20 nitrocellulose filter overlay technique. 



45. The method of Claim 44 wherein cells producing 
antigen-combining molecules of selected specificity 
are enriched for cells producing antigen-combining 
molecules on their surface by affinity matrix 

25 chromatography . 

46. Cells produced by the method of Claim 42. 
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47. Antigen-combining molecules produced by cells of 
Claim 42. 

48. DNAs encoding immunoglobulin heavy chain variable 
regions or immunoglobulin light chain variable 

05 regions, present in cells of Claim 42. 

49. Viruses produced by the method of Claim 26. 

50. A method of isolating viruses of Claim 49 encoding 
antigen- combining molecules of selected specificity, 
comprising the steps of: 

10 *) infecting host cells with an appropriate virus 

containing DNA encoding antigen-combining molecules; 

b) coating the virus with antigen- combining 
molecules which the virus encodes; and 

c) subjecting the product of step (b) to 
affinity-matrix selection, to separate the virus 
according to the antigen-combining molecules they 
contain:* 

51. Viruses produced by the method of Claim 50. 

A 
»» ' 

52. Antigen- combining molecules encoded by viruses of 
20 Claim 51. 



15 
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